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SUMMARY 

This  report  is  a  continuation  of  RM— 1399 »  Differential 
games  II :  The  Definition  and  Formulation. 

Its  essential  contents  are:  the  differential  equation 
technique  for  solving  differential  games  (at  least  "in 
the  small")  end  the  Verification  Theorem,  a  device  which 
enables  one  to  prove  that  the  answers  obtained  8re  the 


correct  solutions 
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DIFFERENTIAL  GAMES  Ills  THE  BASIC  PRINCIPLES 
OF  THE  SOLUTION  PROCESS 

by 

Rufus  Isaacs 

(This  work  is  a  continuation  of  RM-1391#  which  we  will  refer  to 
as  II.  ) 

1.  The  Na ture  of  a_  Solution 

When  we  analyse  a  differential  game  we  shall  be  interested 
in  three  goals:  the  optimal  tactics  of  each  player  and  the  value 
of  the  game.  We  shall  say  the  game  hss  a  solution  when  all  three 
items  exist;  collectively  they  are  the  solution.  We  will  have 
solved  the  game  when  either  we  have  found  a  solution  or,  in  case 
none  exists,  when  we  have  become  as  enlightened  as  the  situation 
permits. 

The  term  optimal  tactics  used  above  refers,  as  already  dis¬ 
cussed  in  II,  to  such  (>(x)  and  tf'(x)  as  may  appear  In  all  K-nstrategies 
which  are  £ -strategies  for  all  sufficiently  small  positive  £.  One 
must  bear  in  mind  that  the  utility  of  K-atrategies  is  for  proofs; 
for  practical  purposes  we  are  concerned  with  the  optimal  ^)(x), 
tf'(x)  (henceforth  denoted  by  $(x),  “fp(x))  themselves.  We  sre  not 
far  wrong  if  we  conceive  of  them  directly  as  optimal  strategies 
and  in  the  future  we  will  often  soeak  in  such  terms. 

We  recall  that  the  data  of  a  game  include  8  particular 
starting  point  in  €■  and  that  we  have  used  the  term  "game"  rather 
freely  for  what  should  be  a  family  of  games.  When  we  speak  of 
the  solution  of  a  game  for  r  certain  subset  £’  of  C>  we  will 
refer  to  all  games  with  starting  points  in  . 


RM-1411 

-2- 


Ihe  process  of  solving  a  game  splits  into  two  phases.  It 

generally  turns  out  that  the  region  £  is  to  be  divided  into  a 

number  of  parts  separated  by  surfaces  which  we  shsll  later  call 

* 

singular  sui face3 .  In  each  part  the  solution  will  be  smooth; 
that  is,  V  will  be  of  class  and  the  $  and will  be  continuous 
functions  of  x.  On  the  singular  surfaces  (or  singular  manifolds 
of  lower  dimension)  various  kinds  of  special  behaviour  will  take 
place,  rhere  are  a  number  of  possibilities.  fhe  whole  subject 
of  singular  surfaces  is  difficult  and  sometimes  their  study  is 
the  essence  of  the  problem.  They  shall  be  dealt  with  later. 

We  shall  use  the  term  in  the  small  to  refer  to  the  "smooth" 
parts  of  the  solution  found  between  the  singular  surfaces.  Ihe 
problem  of  identifying  the  singular  surfaces  and  assemblying  the 
"smooth"  parts  into  the  total  solution  will  be  described  by  the 
phrase  in.  the  large. 

We  shall  see  that  the  technique  in  the  small  is  one  of 
differential  equations.  Hi  is  phase  of  the  problem  plays  a  part 
in  full  solution  somewhat  analogous  to  that  played  by  the  Euler 
equations  in  the  calculus  of  variations.  In  fact,  for  some  one 
person  games  the  approaches  become  identical.  ihere  is  nothing 
radically  new  in  the  concept  of  a  singular  surface.  They  appear 
in  some  classical  problems,  but  inconspicuously.  Motives  for  em¬ 
phasizing  them  appear  only  when  we  introduce  the  second  player 


*  Ihere  may  also  be  singular  manifolds  of  dimension  <  n— 1;  such 
of  course  cannot  separate  the  components  of  £. 


1 


1 
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and  look  at  matters  game  theoretically. 

It  is  not  possible  to  make  a  categorical  statement  as  to 
which  of  the  phases  Is  the  more  important.  There  are  examples 
where  the  solutions  in  the  small  are  extremely  simple  (for 
example,  the  paths  msy  be  straight  lines  traversed  at  constant 
speed)  while  the  assortment  of  singular  surfaces  is  abundant 
and  involved.  On  the  other  hand,  there  may  be  few  or  no  singular 
surfaces,  but  we  may  encounter  involved  differential  equations. 

2.  The  Main  Equation 

We  suppose  that  the  value  of  a  differential  game  exists.  It 
will  depend  on  the  starting  point  x  and  we  denote  it  by  V(x).  We 
shall  show  that  v(x)  satisfies  a  first  order  partial  differential 

equation,  to  be  called  the  main  equation,  whenever  v(x)  la  of 

•* ** 

class  , 

For  games  with  integral  payoff  the  main  equation  is 

(1)  min  max  £  Z  V  f.U,^)  +  0(x,^,lp)3  -  0 

t>  4  j  J 

and,  for  games  with  terminal  payoff, 

(2)  .  min  max  Z  Vv  f^x.^tf)  -  0  . 

*  Example  2  of ■ the  following  chapter  will  illustrate  this  point. 

**  'that  is,  V  has  all  it3  first  partisls  and  they  are  continuous. 


i 

i 
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From  the  minimax  assumption,  It  makes  no  difference  if  the 
min  and  max  In  (l)  and  (2)  are  reversed. 

We  shall  give  two  proofs.  The  first  will  follow  immediately. 
It  is  frankly  heuristic.  But  it  is  instructive.  For  example,  it 
is  possible  to  solve  differential  games  approximately  by  discrete 
methods.  Ihe  first  proof  will  point  the  way. 

We  utilize  what  we  have  called  the  tenet  of  transition.  The 
germ  of  the  idea  is  that  we  are  dealing  with  8  family  of  games 
based  cn  different  starting  points.  Let  us  consider  an  interval 
of  time  in  midplay.  At  its  commencement  the  path  has  reached 
some  definite  point  of  €.  We  consider  all  possible  x  which  may 
be  reached  at  the  end  of  the  interval  for  all  possible  navigational 
choices  by  both  players.  We  suppose  that,  for  each  endpoint,  the 
game  beginning  there  has  already  been  solved;  in  other  words,  V 
is  known  there.  Then  the  payoff  resulting  from  each  choice  ^>,  tj' 
during  the  Interval  will  be  known  and  the  navigation  variables 
are  to  be  ao  chosen  as  to  render  it  minimax.  When  we  let  the 
duration  of  the  interval  approach  zero,  the  result  yields  a  dif¬ 
ferential  equation. 

We  couch  the  above  reasoning  formally.  First  let  the  payoff 
be  integral.  Let  V  be  known  at  x  In  £  which  has  been  reached  in 
a  play  at  time  t.  A  3hort  time  later  —  let  us  candidly  label  the 
Interim  dt  -  the  play  has  progressed  to  the  (variable)  point  x'. 
Then 

(3)  (P  *  payoff  at  x  *  )dt  +  V(x') 


RM— 1411 


.  * . 

=  (in  the  language  of  primitive  calculus  ) 

C a(x^,tj/)  +  x  v  f  (x,i^)jdt  +  v(x)  . 

J  XJ  J 

The  <{>  and  ip  in  the  last  line  are,  cf  course,  their  values 
at  x.  To  minimax  (P,  they  must  be  chosen  to  minimax  the  bracket. 
(Their  values  will  then  be  those  of  an  optimal  tactic  at  x. )  But 
when  this  is  done,  che  payoff  will  be  the  value;  thus,  to  make 
the  equation  balance,  the  bracket  must  be  zero.  We  have  (1). 

When  the  payoff  is  terminal,  almost  the  same  reasoning 
applies.  In  this  case  optimal  play  during  the  Interval  means  that 
V  will  be  unchanged  during  it;  we  can  suppress  the  integral  in  (3) 
and  proceed  as  before. 

This  proof  instructs  us  as  to  a  distinction  between  games  with 
the  two  kinds  of  payoff.  Where  the  optimal  path  of  x  is  differen¬ 
tiable,  along  it 

<4)  ~  ]  **{)<*•*#>  ■ 

By  examination  of  the  bracket  above  we  learn: 

When  the  payoff  is  Integral  the  paths  of  optimal  play 
penetrate  the  surfaces  on  which  V  is_  constant;  when  it  is 

*  The  reader  speaking  a  more  sophisticated  tongue  (mean  value 
theorems,  etc.)  can  readily  translate  our  idea  into  terms  of 
greater  rl ^or. 

**  except,  of  course,  when  Q  Is  0  during  an  interval 
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Integral ,  die  pa uia  remain  throughout  the  play  In  the  same  sur¬ 
face  of  constant  V. 

The  second  proof  of  the  main  equation  depends  on  a  new  con¬ 
cept. 


3.  Semipermeable  Surfaces 

We  take  it  thst  each  small  portion  of  the  surfaces  under 
discussion  separate  the  neighboring  space.  As  orientation  is 
germane  to  our  purpose,  we  distinguish  the  two  directions  in 
which  the  surface  may  be  penetrated,  calling  them  the  P—  and 
E-directlons.  The  "side"  of  the  surface  reached  after  penetration 
in  the  P— [E-]  direction  will  be  called  the  P— [E— ]  side.  We  take  a 
point  x  on  a  so  oriented  surface  and  visualize  the  full  vectogram 
at  x.  We  will  say  the  surface  is  semipermeable  at  x  when  the 
following  is  true: 

There  is  at  least  one  value  £  of  £  such  that  if  no 

vector  in  vectogram  penetrates  the  surface  in  the  E-direction. 
Similarly,  there  is  a  if  which  prevents  penetration  in  P-dlrection. 

A  surface  having  this  property  at  each  point  will  be  called 
semipermeable. 

Suppose  the  f^  are  all  separable.  Then  we  observe  that  if 
a  and  ^  there  will  be  strict  penetration  in  the  F-direction. 
It  is  this  fact  that  csuses,  in  the  separable  case,  optimal 

*  ?c  <n  will  be  used  interchangeably  to  denote  a  value  of  *>0] 
with  the  described  property  or  the  set  of  all  such  values. 
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H  strategies  to  be  best  strategies. 

We  have  already  seen  that  we  can  transform  a  game  with 
integral  payoff  into  one  with  terminal  psyoff.  Consider  8n  in¬ 
stance  of  the  latter  which  we  supposed  solved,  and  for  which  V(x) 
has  at  least  two  values. 

Any  surface  which  separates  the  parts  of  £  where  V  >  c  and 
V  <  c  (c,  any  constant)  must  be  semipermeable  with  V  decreasing 
as  the  surface  is  crossed  in  the  P-direction.  For  if,  at  some 
point  x  of  the  surface,  there  were  no  P  could  not  prevent  E 
from  pulling  x  into  the  side  with  the  lsrger  V.  Similarly,  there 
is  a  ip. 

|  ilie  critical  reader  who  demands  more  precise  reasoning  than 

this  will  see  in  the  next  section  how  it  may  be  supplied.  There 
the  reasoning  is  based  on  K-atrategies  which  we  have  accepted  as 
the  sole  thoroughly  rational  bulwark  of  the  theory. . 

Now  suppose  in  a  certain  subregion  cf  Z,  V  is  of  class  C^. 

Ihen  the  surfaces  on  which  V  is  constant  will  be  semipermeable. 

The  vector  grad  V  ■  fv  "l  is  normal  to  such  surfaces.  Whether 

l  X1  /  t 

a  moving  point  penetrates  the  surface  in  one  direction  or  the 

other  or  not  at  all  depends  on  the  sign  of  its  velocity  component 

along  this  vector.  That  is,  in  the  case  where  V  is  of  class  C^, 

the  semipermeability  condition  for  the  surfaces  of  constant  V  is 

min  max  I  V  f.  (x,$>,*p)  »  0 

4>  tp  i  X1 


But  this  is  the  main  equation. 
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Suppose  we  had  begun  with  an  integral  payoff  case  and  per¬ 
formed  the  transformation  in  II  i5.  We  see  that  if  we  compare 
the  games  emanating  from  two  points  which  differ  only  In  their 
values  of  xn+1  (let  d  be  this  difference)  the  games  will  be  Iden¬ 
tical  except  that  there  is  a  difference  d  between  pairs  of  corres¬ 
ponding  payoffs.  Ihus  Vv  **  1.  Also  f^,,  -  G.  Putting  these 

xn+l  n+A 

values  into  ( 2 ),  we  obtain  (1). 


4. 


Ihe  Verificiation  'theorem 


We  do  not  purport  to  give  an  existence  theorem  for  differential 
games.  Our  interest  lies  in  solving  problems.  In  the  sequel  we 
shall  explain  methods  and  exhibit  examples.  But  whst  we  do  need 
is  a  technique  for  showing  that  the  results  of  our  methods  ac- 
1  tually  are  solutions.  Such  is  provided  by  the  subject  theorem. 

It  Is  actually  no  more  than  a  sedulous  application  of  the  semi- 
permeable  surface  concept  of  the  last  section.  Its  advantages 
are,  as  the  reader  will  later  see,  that  It  verifies  our  solution 
methods  almost  automatically.  In  fact,  it  may  be  quite  possible 
to  construct  an  existence  theorem  from  the  solution  technique  and 
the  present  theorem  for  a  properly  delineated  class  of  games. 


Let  us  suppose  that  we  have  found  an  alleged  solution  of  a 
certain  game  with  terminal  payoff.  We  suppose  first  that  it  is 
a  game  of  degree,  reserving  the  other  esse  for  later.  By  this 
we  mean  H  is  a  continuous  function  on  C.  Let  V(x)  be  the  pur¬ 
ported  value.  There  may  be  a  subdivision  of  €.  by  singular  sur¬ 
faces.  Experience  has  shown  that  the  only  types  of  such  surfaces 
on  which  V  fails  to  be  differentiable  are  ones  that  are  never 
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* 

crossed  by  the  optimal  path3.  We  will  suppose  this  tc  be  the 

case  here  and  we  shall  either  remove  any  such  (alleged)  singular 
surfaces  from  £,  or  think  of  them  as  "slits"  so  that  on  the  "cut" 
£  we  may  expect  \  co  be  of  class  C^.  There  may  also  be  a  part  of 
£  with  the  property  that  the  alleged  optimal  paths  beginning  there 
do  not  terminate.  If  so,  we  remove  it  from  £.  What  is  left  of  £ 
after  both  these  kinds  of  surgery  we  will  call  £' .  Generally  it 
fulfills  the  hypothesis  of  the 

Verflcatlon  Theorem.  If  V(x)  is  of  class  in  £' ,  if  it 
satisfies(P)  and  equals  H  on  C,  then  V(x)  is  the  value  of  the 
game,  provided  the  assumptions  made  in  passing  from  £  to  £'  are 
correct.  The  optimal  tactics  consist  for  those  classes  of  func¬ 
tions  of  x  3uch  at  each  position  they  provide  the  min  end  max  in 
(2). 

Proof :  We  can,  as  we  have  seen  in  II,  without  changing  the 
nature  of  the  game,  arrange  that  all  velocity  vectors  in  each 
full  vectogram  have  at  most  unic  length.  We  do  so. 

Let  us  select  a  tactic  $(x)  for  P  such  that  for  each  x,  ^ 
is  minimizing  in  (P).  We  let  E  play  any  K— strategy;  let  t^(x)  be 
Its  tactic.  Play  starts  from  x°.  Given  an  £  >  0,  we  are  going 
tc  complete  P's  K— strategy  by  constructing  so;.  We  ahsll  speak 
83  if  P  were  to  play  indefinitely;  of  course,  we  need  but  curtail 
cur  scheme  when  C  i3  reached. 


*  Example  of  II  is  an  instance.  Although  there  are  infinitely 
many  singular  surfaces,  V  = 
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Divlde  time  into  intervals  Im(m  <  t  <  m  +  1,  m  »  0,  1,  •••)•  In 
1^,  x  cannot  be  further  than  m  +  1  from  the  starting  position  x°. 

We  can  take  it,  that  for  x  so  bounded,  each  V  and  f,  is  uniformly 

J.  .  J 

continuous  in  x,  the  uniformity  holding  for  all  f>,  y.  The  same  is 

true  of  2  V  f .  «  Q(x,  {>,  y).  From  the  bound  on  speed,  we  can 
J  XJ  J 

complete  crt  by  subdividing  Im  into  I  such  that,  during  each,  the 
change  in  0  <  *  The  Tmp  may  furthcr  subdivided  into  Impq 

by  the  t£  of  c*.  In  each  Impq  and  y  are  constant  (-  fcmp  and  tmpq) 
and  so  »  Q.  Let  x'  be  the  position  at  the  start  of  I  and  x 
be  a  point  reached  during  Impq.  Then 

|Q(x,  t>mp,  fmpq)  -  Q(x',  *>mp,  tmpq)l  < 
and,  as  ^mp  satisfies  (2)  at  the  start  of  Imp, 

V  «<x'>  ♦’op'  ^  ’  0 

we  have 

3?  durln*  Lpq  ■  «<x>  *mp’  "W  <  ’ 

As  this  inequality  holds  through  Im,  the  gain  there  of  V  < 
and  so  the  total  gain  <  6.  But  when  C  is  reached,  V  *•  H  ■  payoff 
and  so  the  latter  <  V(x°)  +  £. 

Likewise  we  can  construct  a  C  for  E  insuring  a  payoff  >  V(x°)  —  £  . 

t 

Thus  V(x°)  is  the  value. 

Above  we  have  tacitly  assumed  the  game  will  terminate.  Whether  this 
is  true  is  a  question  of  a  game  cf  degree.  Suppose  that  the  answer 
assures  us  of  termination;  it  generally  happens  that  it  is  occasioned 
by  the  optimal  tactics. 

» 

forward  time  derivative 
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®  Now  let  us  consider  the  singular  surfaces  slashed  from  £  in 

the  construction  of  If  no  velocity  f(x leads  x  onto  such 

a  surface  from  sufficiently  proximate  points. then  the  assumption 
of  the  surface  being  uncrossed  during  optimal  play  is  corroborated. 

Suppose  besides  8  part  £",  in  which  it  was  alleged  that 
optimal  play  did  not  terminate,  was  removed  from  £.  It  may  be 
that  neither  player  can  invoke  termination  from  £"•  In  that  event, 
might  well  have  been  discarded  at  the  outset;  it  has  hardly  a 
legitimate  claim  to  belong  to  £  in  the  first  place.  More  usual  — 
this  being  a  game  whose  essence  is  conflict  —  one  player  will 
strive  for  termination;  the  other  will  oppose  it.  The  boundary 
between  £'  and  £"  will  be  semi permeable;  the  details  fit  into  a 
^  discussion  of  this  subject  still  to  come. 

We  now  turn  to  games  of  kind.  We  already  know  that  our  ob¬ 
jective  here  is  to  divide  €  into  discrete  parts  corresponding  to 
the  discrete  payoffs.  We  can  generally  consider  £"  as  one  of 
these  parts.  This  is  certainly  true  if  we  decree  a  stop  rule 
(see  II,  section  3);  the  case  where  we  don't  will  be  treated  in 
a  subsequent  chapter. 

The  salient  point  is  that  the  boundaries  of  this  subdivision 
are  appropriately  oriented  semlpermeable  surfaces. 

Verification  Theorem  (dames  of  Kind).  If  semlpermeable  sur¬ 
faces  with  continuously  turning  tangent  hyperplanes  exist  in  €  such 
that  they  meet  £  at  the  loci  where  H  changes  values  snd  they  are 
correctly  oriented  (that  is,  the  P-side  corresponds  to  a  payoff 
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lower  then  that  of  the  F— side),  then,  if  these  sui-fsces  separate 
£,  the  value  of  the  game  will  be  constant  in  eech  component  and 
equal  to  the  of  H  on  C  therein.  We  assume  that  for  each  component 
of  £,  except  possibly  £",  the  game  will  terminate. 

Proof:  If  S  is  one  of  these  surface  we  can  imbed  it  in  a 
family  of  semipermeable  surfaces  whose  union  is  a  thin  layer  con¬ 
taining  S.  We  define  a  function  V^(x)  over  the  layer  constant  on 
each  surface,  of  class  C^,  and  decreasing  in  the  P-directlon. 

This  construction  amounts  to  a  problem  in  partial  differential 
equations  and  its  solution  is  known  to  exist.  If  the  reader  de¬ 
mands  an  explicit  construction  we  can  easily  infer  one  from  our 
solution  techniques  to  be  explained  later. 

We  reason  about  this  layer  Just  the  way  we  did  about  £'  in 
the  preceding  proof.  We  find,  say,  that  if  x  is  on  the  P-elde 
of  S,  say  at  a  "distance”  €  from  S,  by  playing  a  suitable  K-atrategy 
P  can  keep  x  from  crossing  S. 

f>.  The  Path  Equations 

We  work  with  integral  payoffs,  the  other  species  being  handled 
by  suppressing  G. 

Being  confronted  with  a  particular  problem  let  us  write  (2) 

and  ascertain  the  maximizing^  and  minimizing  $>  as  functions  of 
* 

the  xi  and  .  Let  them  be 


We  will  henceforth  write  V.  in  place  of  V 
for  the  vector  ■[vj  (-  grad  V). 


V  will  stand 

A 


«  (5)  T-?(X.VX)  ,  *  -  ?(X,VX)  * 

If  there  la  a  choice  involved  in  (5),  select  $  to  be  continuous 
functions  of  their  2n  arguments  insofar  as  it  is  possible.  ( ihe 
last  clause  generally  acquires  a  definite  meaning  in  a  particular 
example.)  Substitute  (5)  into  (2): 

(6)  2  V.  (x)f,(x, $■,{?)  +  0(x,$,ip)  =*  0  . 

i  1 

We  have  now  a  true  partial  differential  equation  for  V  In 
(6).  It  also  shall  be  referred  to  as  the  main  equation. 


We  differentiate  (6)  with  respect  to  each  x^  now  thinking 
of  V  as  a  function  of  x.  Doing  so  in  accordance  with  the  rules 
of  elementary  calculus,  we  examine  the  components  as  they  arise. 
First  we  have 


(7) 


IV  f 

i  xixk 


1 


which  can  also  be  written 

<8> 


that  is,  the  time  derivative  of  over  a  direct  optimal  path. 
Next  we  have 


(9) 


l  Vik 


+  a, 


# 
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We  are  not  really  overworking  these  symbols.  When  V  becomes 
known  as  a  function  of  x  and  so  substituted  in  (5)#  it  is  clear 
that  we  will  have  our  old  ip. 
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ch\ 


\  n 

where  f , ,  (x,u,v)  * **  t — (x,u,v)  and  Q.  =  hen  we  encounter 

0*k  ^  O^k 


(10) 


1  —  (z  v1f,+a) 

4-iAi  4  11  ax,. 


J-l  1 


Each  Is  supposed  subject  to  constant  bounds  such  aa  (10) 
of  II.  The  minimizing  occurs  either  interior  to  the  constraining 
interval  or  at  an  endpoint.  If  the  former,  the  (  )  term  of 

(10)  Is  0  because  of  the  minimizing  property  of  if  the  latter, 

^4 

the  is  0  as  is  constant.  In  either  esse  then  (10)  vanishes. 

dxk  3 

The  same  Is  true  of  the  remaining  terms  devolving  on  the  iK.  We 
conclude 

(11)  vk  -  -I  V1  f  ';k(x,?(x,vx),  f(x,vx))  +  ak(x,f(x,vx),  ^(x,Vx)). 


Rewriting  the  K.E.  ,  slightly  specialized. 


(12) 


K  -  fjx,?(x,vv),  *(x,vj)  . 


This  set  (11),  (12)  of  2n  ordinary  differential  equations  in 
the  2n  unknowns  xk,  Vk  shall  be  called  the  path  equations.  It  is 
not  necessary  to  presuppose  the  existence  of  second  partials  of  V 
such  as  appear  in  (7)  and  (8).  Actually  (11 ),  (12)  are  the  charac¬ 
teristic  equations  of  (6)  (slightly  special  in  thst  the  terms  (10) 
are  nullified).  Solutions  of  (6)  can  be  built  from  integrals  of 
(11),  (12)  in  the  standard  manner,  a  procedure  which  we  shall 

*  We  shall  use  such  obvious  abbreviations  henceforth. 

**  See,  for  example,  Courant— Hilbert ,  Methoden  der  Mathematlsche 
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shortly  adapt  tc  cur  purposes. 

6.  The  Retrogression  Principle 

When  solving  a  game  we  reverse  time;  we  start  at  <?  and  work 
backwards  Into  £.  The  motivation  can  be  easily  understood  if  the 
game  is  quantized. 

Let  us  replace  the  K.E.  by  approximating  difference  equa¬ 
tions.  The  exact  manner  is  not  critical  for  our  present  ends; 
we'll  settle  for  any  reasonable  discrete  facsimile.  Consider 
starting  x  so  near  C?  that  it  can  be  reached  in  one  move  by  each 
player.  What  we  have  here  is  a  one  move  discrete  game;  the  navi¬ 
gation  variables  are  chosen  as  the  optimal  strategies  for  it. 

This  settles  the  value  of  the  game  In  a  certain  thin  part  of 
€  bordering  (?.  Next  we  similarly  investigate  the  starting  points 
from  which  can  be  reached  after  one  move  each  by  the  players. 
Inasmuch  as  values  at  the  end  of  the  composite  move  are  known, 
we  again  can  formulate  matters  as  a  one-move  game.  (What  we  are 
doing  here  is  applying  the  tenet  of  transition;  compare  the  first 
proof  of  the  main  equation  in  Section  2.)  Thus  the  value  becomes 
known  In  a  second  layer  bordering  We  proceed  thus,  filling 

e. 


The  value  of  the  game  is  thus  determined  by  a  chain  of  causes 
and  consequences  that  proceeds  counterchronologically  from  C. 

Accordingly,  we  let  r *  -t  and  use  the  symbol  $  for  ^  , 
that  8  ■  — t.  For  reference,  we  rewrite  the  P.E.: 


so 
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0  * 

7k  =  f  +  Gk^x-W) 

ik  .  -rk(x,f.f)  . 

7.  The  Solution  In  the  Small 

The  surface  C  furnishes  a  natural  seat  of  Initial  conditions 
for  these  retrogressive  equations.  But  an  Important  detail  calls 
for  attention  first. 

Consider  a  position  very  near  C.  One  or  the  other  player 
may  be  able  to  force  or  deter  an  imminent  termination  despite 
any  opposition  from  his  opponent.  Let  V“  (V^,...,Vn)  be  vector 
normal  to  C  from  point  x  on  (5  and  extending  into  £.  If,  say, 

®  (15)  min  max  ZV,f.(x,^)  >  0 

4>  $  i  1  1 

then  E  can  prevent  immediate  termination  from  a  position  sufficiently 
near  x.  If  (15)  holds  with  the  inequality  reversed,  P  can  compel 
immediate  termination. 

There  la  the  question  of  whether  a  player  will  benefit  from 
the  exercise  of  such  power.  Sometimes  the  answer  la  obvious.  We 
cite  the  case  of  termination  time  psyoff ;  clearly  E  will  defer 
termination  whenever  he  can.  But  in  other  instances  E  may  see 
that  avoidance  of  the  frying  pan  now  will  only  lead  to  the  fire 
later.  We  leave  the  intricacies  of  such  questions  to  individual 


<■ 


I 


Ror  terminal  payoff  games,  we  merely  suppress  the  0 


T 


ci  “  **‘l  ^  S1 '  *  *  *  *  sn— 1  ^ 
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cases.  But  we  must  crystallize  one  concept. 


The  above  situation  can  imply  that  only  a  certain  part  of  C 
will  be  effective  under  optimal  play.  We  call  it  the  useable 
Part.  It  is  the  residue  of  (5  when  we  remove  the  portions  where 
one  player  can  -  and  profitably  can  —  forestall  termination  from 
nearby  points. 

Now  we  formulate  the  initial  conditions  for  (13).  (14).  Let 


be  a  parametric  representation  of  C.  On  C,  V  is  known.  Let  it  be 
v( s ) .  If  the  payoff  is  terminal,  V(s)  =  H;  if  integral,  V(s)  «  0. 
We  need  to  know  the  values  of  the  on  <?.  We  have 


(17) 


5^V(a)  '  f  vi  5^  <*ml 


— 1 ,  • .  • ,  n— 1 ) 


which  are  n— 1  equations  for  the  n  unknowns  V^.  The  remaining 
equation  Is  the  main  equation  (6).  We  need  only  solve  this  sys¬ 
tem  for  the  U.P.  of  C.  Sometimes  a  double  solution  will  appear. 
Tine  reason  is  that  there  is  nothing  in  our  analysis  to  distinguish 
the  two  sides  of  0.  It  is  not  hard  to  do  so  by  other  means  and 
then  discard  the  solution  not  pertaining  to  £. 


Thus  the  values  of  xi  and  V  will  be  known  on  the  U.P.  of 
C.  They  are  to  be  employed  as  initial  values  in  integrating  (13), 
(14).  The  solutions  furnish  (reversed)  paths  xi (T(  , . . . , a  ^  ) 

extending  from  the  points  s  of  C  Into  Should  they  fill  £ 
univalently  (exactly  one  path  through  each  point)  the  game  is 
virtually  solved.  _r1he  elements  of  the  solution  can  be  obtained 


; 
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by  routine  calculation  and  ti.e  verification  tneorem  will  8;,ow  it 
is  correct. 

If  £  is  noi  so  filled,  we  can  still  expect  to  obtain  a  solu¬ 
tion  in  part  of  £.  It  will  be  tne  first  step  of  tr»e  solution  in 
tne  large.  As  tne  later  portions  of  t i.e  solution  are  constructed, 
we  may  find  ourselves  repeating  tne  w.ole  procedure  with  new  sur¬ 
faces  playing  the  role  here  allotted  to  C. 

Although  we  wish  to  segregate  the  in— the— large  phases  to 
later  paragraphs,  it  will  be  instructive  to  mention  one  simple 
type  of  singular  surface  now.  Suppose  the  functions  if  of  (5) 
are  not  continuous;  but,  for  example,  $  has  a  simple  discontinuity 
when  a  certain  function  u(x,Vx)  changes  sign.  On  C  we  will  know 
ti.e  sign  of  u  and  can  construct  the  psths  accordingly.  From  these 
solutions  of  (13)  and  (14)  we  will  know  the  value  of  u  along  a 
path.  Let  us  say  that,  for  each  s  (that  is,  path)  there  occurs  a 
value  of  r  where  u  ceases  to  have  the  favorable  sign,  and  all  such 
points  together  constitute  a  surface  O'  in  £.  Thus  our  solution 
construction  is  halted  BtO'. 

But  O'  may  well  be  a  transition  surface,  that  is  one  crossed 

by  the  optimal  paths  but  on  which  at  least  one  of  the  optimal  tactics 

* 

is  discontinuous.  fo  find  out,  we  need  but  proceed  with  the  con¬ 
struction.  We  use  O'  83  b  new  seat  of  initial  conditions  and  on  its 
far  aide  solve  the  path  equations  anew.  These  initial  values  of 

x. ,  V  are  of  course  obtained  from  the  earlier  paths,  but  we  uti— 

1  xi 

lize  for  ^  the  new  value  ascertained  from  the  changed  sign  cf  u. 


*  For  an  Instance  see  Example  3  of  11. 


